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Abstract 

Presented here is the exphcit solution to the continuous time approximation 
of the Albert-Barabasi scale-free network model at any time t. The solution 
is found by recursively solving the differential equations via integrating fac- 
tors, identifying a pattern for the coefficients and then proving this observed 
pattern using induction. 
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1. Introduction 

The Albert-Barabasi scale-free network model has received a great deal 
of attention for well over a decade since the preliminary work, focused on 
determining the mechanism that producing_power-law degree distribution of 
certain real- world networks, was written Work rapidly ensued in the 
application of such network structure to phenomena such as personal prefer- 
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ences, language structure, statistical physics (Bose-Einstein condensatioii), 
epidemics and a veritable wealth of biological examples 
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The intrinsic concept of preferential attachment, previosly called 
cumulative advantage, was first proposed by Price as an application to cita- 
tion networks with a slightly more general model 12|. Despite the decades 



that this concept has persisted in the literature, an explicit solution to the 
degree distribution of such a network has been absent. 

One such method used to describe the dynamic evolution of such a net- 
work is the doubly indexed, by degree and time, difference equation |l3| 

Vk,titi) = ^j—-^Pk-i,t-i{ti) + - 2(t- 1) ) Pk,t-iiU)- 

In this model new nodes are introduced at unit times with a single edge to 
connect preferentially to highly connected existing nodes. The solution to 
;his model was found asymptotically, i.e. as t — )■ oo, by Dorogovtsev et al. 



14| . The continuous time analog was simultaneously solved by Krapivsky 
et al., again in the "most interesting asymptotic regime {t — )■ oo)" [15]. It 
is unclear if Karpivsky and his co-authors found the explicit time t solution 
for this paper specifically. Similar later work by Krapivsky and Redner does 
indeed calculate the explicit solution for the degree distribution for a model 
where rewiring of connections and a more general preferential structure is 



modeled [16|. In this, and a previous paper ( 17|), the author's investigate 
various correlations, e.g. between age of nodes and their connectivity, in 
addition to utilizing rate equations (differential equations) and the generat- 
ing function (partial differential equations) to compute the form and various 
moments of the degree distribution. 



Exact solutions for the robust metrics (e.g. degree distribution, mean 
path length, eigenvalue structure, etc..) are important in the analysis of the 
structure of the networks themselves or phenomena spreading on them. In- 
deed, Dorogovtsev et al. explicitly calculated the path length of a small world 
network structure, 18|, from the discrete model, and Newman et al. did the 



same to the mean field model 



19| . Wang et al. address epidemic spread 



on complex networks without addressing the specific network archetype con- 



sidered 20|. Instead, they analyze the epidemic threshold produced via the 
eigenvalue structure of complex topology. Valente has proposed a technique 
for investigating thresholds of spreads on social networks (either established 
or as they form) based on the type of adoption employed in the diffusion 



of innovation by individuals j2l|. While he does not explicitly address the 
network topology itself, it is clear that a need for the distribution on the 
number of connections, as well as their "quality", at any time during the 
spread is important to understanding the resultant diffusion. 

Presented here is something of a side-generalization of the scale-free model 
that was previously solved asymptotically by Krapivsky et al.., again explic- 
itly by Krapivsky and Redner and in other approximate forms by Barabasi, 



Albert and Jeong (|22l|): 



Nk{t) = A5k,m + Am [nk-i{t)Nk-iit){l - 5k,m) - Mt)Nkit)] ■ (1) 

I allow new nodes to enter the network at constant rate A, and connect 
to m existing nodes with density dependent preference TCkit) = -j^, where 
D{t) = 2A'mt + Dq is twice the count of edges within the network with Dq/2 



initial edges. This is the so-called rate equation(s) for the counts of nodes 
with degree k. The derivation of such an equation is well handled in a num- 



ber of review papers 
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25|. When considering the previously solved 



model equivalence is obtained when there is no rewiring (r = 0), the ker- 
nel is strictly the Albert-Barabasi preferential attachment (A = 0), both the 
node introduction rate and minimum degree are unit (A = m = 1) and when 
the initial condition is described by two nodes connected with a single edge 
{Nq = Dq = 2 and A^(0) = 25^ m). From this description one can see that 
the model considered here is dynamically less-rich (the process of rewiring 
and of more general attachment is excluded) but considers a more general 
"introduction regime" (the newly introduced nodes are a bit more general). 



A common problem with the application of scale-free networks is that 
while it does match the form of many established structures, i.e. the World 
Wide Web ([26.]), it does not properly match the evolution of such structures 
{2?! . The goal here is to produce explicit, time t, solutions which may be 
used in addressing questions which take place during the evolution of such 
network, e.g. synchronicity or "worst-case" spread of disease on networks as 



considered in 



28| and ^\ respectively. The ability to compare these time 



t solutions with specific application data may allow for researchers to more 
accurately propose better models for the phenomena in question. 



In Section [2] I will go over the general recursive form of the solution found 
via integrating factors. A few base cases, for k = m, m + 1, and m + 2 are 
worked out in Section [3l and a proof by induction of the recognized form 
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is given in Section HJ The paper ends with a discussion of the decay of 
dependence on initial conditions and a few "tweaks" to and critiques of the 
model are suggested that may lead to better approximations. 



2. General Solution Form 

For the sake of space we define the "double-count" of all edges in the 
network at time t hj D{t) := 2 Ami + Dq, the total number of nodes as 
N(t) := At + A^o, and let Rk(t) := 7ik(t)Am. Further note that 

el = D{t)^ =:Hk{t). 

The general solution to Nk{t) can thus be given recursively as 

Nk{t) + Rk{t) = ASk,m + Rk-l{t)Nk-l{t){l-6k,m), (2) 

[Nk{t)Hkit)] = {ASk,„, + Rk-i{t)Nk-i{t){l-5k,rr.))Hkit), (3) 

and finally 

, . ^ / [AHkit)6k,m + Rk-l{t)Nk-l{t)Hkit){l - 5k,rrd] dt + Ck 

''^ ' Hk{t) ' ^ ' 

H2{t) ^ , j Rk-i{t)Nk-i{t)Hk{t){l-5k,m)dt + Ck 

- Ok,ra + - TrTT^ ■ > (5) 



m{k + 2) Hk{t) 
where the final step follows from the two observations (when k ^ —2) 



Hk{t)dt = j\2Kmt + Do)^dt, 



k + 2 



{2Amt + Do)— Hk+2{t) 



Am{k + 2) Am{k + 2) 



and 



(6) 



Hk+n{t) {2Amt + Do) — 

TT u\ = ~ 7^ = ^n[t). [7) 

^k{t) {2Amt + Dop 



The constants of integration, C^, are defined by 

^2(0) ^ jRk-i{t)Nk-i{t)Hkmi-6k,m)dt + Ck 
m[k + 2) Hk(t) 



These expressions have the general forms (proven later by induction) 



k 



{m + l)V{k)H 2{t) ^ ^ fk-l\ Q 
T{k + 3) 

with the constants of integration 

(_i)i-m+ir(i)i7, 

T{i-m + l)T{m + l){i + 2) 



The typically assumed initial conditions are A^(0) = m + 1 and -D(O) = 2m 
(i.e. A''fc(0) = 6k,m)- These expressions for Nk{t) and Ci condense to a closed 
form solution in terms of a hypergeometric function, 2F1, (presented without 



derivation). Defining G{t) := |^ = + 1 the solution to Nk{t) is 



^ (m+l)r(fc)iJ,(t) Akjt) (G(t) - i)^-r(fc) 
^ r(A; + 3) G""(t) G"=(t)r(A;-m + l)' ^ ' 



where 



r(^)2-^i + 2, m - A;; m + 3; 
^''^^^ " (m + 2)r(m+l)r(fc-m + l) ' 
It is relatively straightforward to verify this result against that of Krapivsky 
and Redner (j3l) using Equations [8] and [9] with m = 1, Hk{t) = (2t + 2)t, 
and Ne{0) = 26e^m to arrive at 



4(« + l) , 1 r(*) (-iF(2j+4) 



/c(/c + l)(/c + 2) (t + 1)1 ^ r(A; - j) ^■!(^- + 3)(t + l)f 
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3. Base Cases 

In order to prove the previous solution expressions I employ a recursive/ 
inductive method. The solution for the smallest degree, m, is calculated 
rather easily on its own with an integrating factor. The degrees k > m 
require the solution to the previous degree (recursion). A pattern is found 
through inspection and then showed to hold in general in the inductive step in 
Section m A "trick" is highlighted in the case of Nm+2(t) which is employed 
to preserve the pattern. 

3.1. Nm+j{t), J = 0,1,2 

In this Subsection I consider only the form of the state variable solution. 
The form constants of integration are treated in the following Subsection. 



3.1.1. J = 

The expression for Nm{t) is given by 

Nm{t) = A - Rm{t)N^{t) = A - Am7r„(t)iV„(t). 

The only additive term is the constant node introduction rate A because each 
new node enters with degree m and thus there may not exist a node of degree 
less than m to "feed into" Nm{t)- The solution follows straightforwardly from 
Equation O 

A r H^(t)dt + 
Nm(t) = — — ^^^^^7- -, 12 

A Hm+2{t) I ^ 
_ ^^Am(m+2) ^ ^^^^ 

^ -m) + (14) 



m(m + 2) ^' H^{t)' 
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3.1.2. j = 1 

The solution to Nm+i(t) depends on Nm{t), since a new node connecting 
to a node of degree m creates a node of degree m + l. Thus, again from 
Equation we have a solution in the form: 

Am J Tlm{t)Nmit)Hm+l{t)dt + Cm+i 
l\m+l{t) - — , [ib) 

^ ^^IW + ^) Hm^^(t)dt + G^^, ^^^^ 

Considering just the integral we may employ the properties Equation |6] and 
[7] to write it as 

Hn.-,iit) ^ mOA = ^ +^H,it). (17) 



m + 2 Hiit) J Am(m + 2)(m + 3) A 

Plugging Equation [T7] into [I6] results in 

(m + 2)(m + 3) Hm{t) Hm+i{t) 

3.1.3. j = 2 

The solution to Nm+2{t) follows in essentially identical fashion to that of 
Nm+i{t) save for one "tricky" step. Within the integral term, as found in 
Equation [20} the term m{m + l)Cm is written as m{m + l)CmHi-i{t)- This 
will not alter the resultant value, but it does keep the form of Cm+i consistent. 
This way of multiplying by 1 has to be employed for each consecutive ODE. 
The solution via Equation [5] is 

_ Amj7r„r+i{t)Nm+l{t)Hm+2{t)dt + Cm+2 
^Vm+2ltj - 77 7-T , liyj 

^ / ( t^ll'^.ZXf +mim + l)C^H,_, (t) + ^^^^±^) dt + C^^^, ^ 

Hm+2it) 
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This produces the solution 

nr _ jm+l) (m + l)mC^ (m + l)Cn,+i Cm+2 . s 

3.1.4. The Pattern 

Inferring the pattern at this point is perhaps a touch premature. Finding 
the solution to A^m+3(^) niakes the formulae evident, but for the sake of space 
I leave this to the reader and outline the pattern here using results from the 
Nm+sit) solution. Each solution is composed of essentially two parts: a part 
that involves the term H2{t) and another that involves the constants of in- 
tegration. 



H2{t):. The coefficients for the iJ2(^)-term follow for j = 0, 1, 2, and 3: 

1 1 (m+l) 1 (m+l) rpi 

TO(m+2) ' (m+2)(m+3) ' (m+4)(m+3)(m+2) ' (m+5)(m+4)(m+3) ' ^ prOgreSSlOU IS 

clear when the first two terms are multiplied and divided by m + 1 to give 
the general coefficient of the H2{t) term for Ni^{t) of 

^i±i . (22) 

Cm+i-'- The coefficients for the Cm+j-term are each divided by the appropriate 
Hm+iit) term. Furthermore, as is clearer from the solution of Nm+sit) there 
is also a term in the form of (^^i^) giving a final form for the coefficient of 

( ^ ' (23) 

\m + t-lj H^+,{ty ' 

Putting the two patterns together I infer a solution identical to Equation |8l 
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3.2. Cm+i) ^ — 0,1,2 

The constants of integration are found rather straightforwardly as 

= fiV„(0) - V„(0), 
\ m[m + 2,) J 

Cm+i = fiV„^+i(0)-miV:^(0) + ^^^)if„+i(0), 

\ I 2[m + 4) J 

The sign before the i^2(0) terms is alternating and with a coefficient of 
("^'"') mimli+2) for The coefficient for the iV^+,(0) terms is i-iy-'Cliz\) 

for £ = 0, 1, i. This gives an expression identical to Equation [91 

4. Inductive Step 

To prove that these are indeed the solutions for all m + j, j = 0, 1, 2, ... 
I will invoke induction. Suppose that for some j > 2 it is true that 

r(m + , + 3) .-1 

By Equation [5] we have that 

_ Am / 7r„+j(t)A/'„+j(t)if^+j+i(t)rft + C„+j+i 
7Vm+j+Utj 77 7-T . 

Considering just the integral, a few cancellations gives the form 

(m + l)r(m + j + l) f ^ , .s^V^ + J-l\^ ,,,,, 

r(m + j + 3) y 5^ jaj H^+,^,^,{t)dt. 

By using the properties of H in Equations |6] and [7] this resolves to 
(m + l)r(m + 3 + 1) rj m , /"^ + A 



Amr(m + j + 4) 
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(m+l)r(m + j + l) vY^+A , 



m+j+1 



Placing this back into the general solution gives 

m+j 

i-ljHi{t) ' H^+j+i{t)' 

(25) 

which is identical in form to Equation |5l Thus, the general formula for 
Nk{t) is correct. Verifying Ci follows straightforwardly from this proof. The 
derivation of the single formula in terms of the hypergeometric function as in 
Equation [5] is left to the reader and easily verified with a computer algebra 
system. 

5. Conclusion/ Discussion 

One may view the solution given in Equation |8] as containing two parts: 
the core dynamics and the initial conditions. The initial conditions are con- 
tained in the series 

E 



i-l)H,{ty 



or perhaps more illustratively in 



i=m 

It is clear here that for the quantity N};{t) the dependence on initial condi- 
tions generated by all Nj{t) with j less than some i decays on the order of 
0{t~h). 

We may also consider the proportion of nodes that have degree /c, 

and find that this both asymptotically agrees with previous findings of the 
degree distribution which do not consider transitory behavior and with those 
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found by Krapivsky and Redner under appropriate restrictions. Also, of 

mathematical interest, I've inadvertently deduced a seemingly novel identity 

for hypergeometric functions. Given that N{t) = At + Nq = J2T=m ^k(t) one 

may conclude for Nq = m + 1, Dq = 2m, and A''fc(0) = 6k,m that 

. ,m r(m) — m ■^r(?+m) / 1 \ 

(1 + Atp- = > \ , 2Fiim + 2, -?; m + 3- 

^ ' (m + 2)r(m + i) ^r(j + i)' 'V ' 'v/AtTIy 



Direct application of this time t solution should be useful for two reasons: 
formation of and dynamics on complex networks. While the exact rules that 
particular real-world networks observe in their formation is an intractable 
key to their structure, one may compare such structures (data) with the so- 
lution given here to justify if this mechanism is applicable. For dynamics 
on non-limiting case networks (e.g. diseases through a smaller population, 
spread of computer viruses on local networks, ideas spreading through form- 
ing social cliques, etc..) this solution may also be employed to describe the 
topology of such structures. Results such as those found by Zhao et. al, on 
the fragility or scale-free networks to attacks on hubs, should be extended 
to include networks at are evolving over time j29]. It is possible, that since 
at time t there is a smaller probability of hubs than in the limiting network 
(i.e. the tail is not fat enough yet) the network is in fact more resilient to 
cascading failures that the seemingly more robust limiting network. 

There are a few flaws with considering these rate equations as a model 
for the A-B network growth. In an arbitrarily small time interval from t = 
there has been mass shifted into compartments, Nk{t), with arbitrarily large 
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degree. Indeed, the process only allows for [2AmtJ + Dq edges at any time t, 
and thus there should be a maximum degree available at any time. Pastor- 
Satorras and Vespignani do this to some extent for general scale-free networks 
with the introduction of exponentially decaying tails induced on the degree 
distribution j30| . This was done in the interest of investigating the epidemic 
threshold for finite sized networks in order to find structures with noncrit- 
ical spread despite the scale-free property. This method however would be 
inadequate for an investigation into the structure of the networks as they 
evolve since the networks were taken to be static in their work. Instead the 
"cut-ofF' could be modeled implicitly with some sort of modified system that 
permits the correct number of ODEs to be acting at any time t. A delay 
type differential equation system containing delays similar to t — [ ^^" J and 
initial conditions like Nk{t) = for t G [— [ ^^° J , O) would perhaps do a 
better job at modeling the transitory dynamics of this system as they posses 
this sort of activation switch, i.e. they may only start to accept "mass" once 
the delay exits the initial data. This would force a finiteness that is otherwise 
lost in the traditional rate equation techniques. 

Further work is also open in the gross generalization of this process. One 
may consider the degree of a new node to be a distributed parameter. In 
other words the degree a node is introduced with is some random variable 
M with probability mass function pm- The concept of rewiring may be 
reintroduced to the model in addition to a suite of more general attachment 
kernels. Under these generalizations it is very likely that the network may not 
retain an asymptotic distribution of P{k) oc A;"''', especially if the attachment 
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kernel differs from proportional. However, the true power of the A-B model 
should be in its flexibility to allow for generalizations that produces richer 
behavior and thus, farther reaching applications. 
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